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Abstract 

We introduce a Gibbs measure on nearest-neighbour paths of length t in the Euclidean 
planar lattice, where each path is penalized by a factor proportional to the size of its 
boundary and an inverse temperature (3. This measure gives a random walk description 
of the Wulff crystal, representing the distribution of a diluted polymer in a poor solvent. 
We prove that in two dimensions, if the inverse temperature j3 is larger than the logarithm 
of the connective constant, then the random walk condensates to a set of diameter i 1 / 3 , 
up to logarithmic corrections. We further speculate that the limiting shape shares the 
same exponents as the KPZ universality class. A similar result holds for a random walk 
conditioned to have local time greater than /3 everywhere in its range. 
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Figure 1: Simulations using a Gibbs sampler algorithm of a random walk cluster with t = 
25,000 steps, corresponding to four different values of (3: f3 = 0.01, /3 = 0.1, (3 = 1, and f3 = 2. 
Pictures realised by Tom Begley. 
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1 Introduction 

1.1 Statement of the main results 

Let d > 1 and let f2 be the space of nearest-neighbours right-continuous infinite paths (ut, t £ 
[0, oo)) on Z d started at ujq = 0, and let (Xt(co),t > 0) be the canonical process. Let P denote 
the law of simple random walk on (17, T) in continuous time with rate one, where J- denotes 
the a- field generated by X. 
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Our main result deals with geometric properties of some penalizations of random walks 
on Z d by their boundary. More precisely, we introduce a Gibbs measure fi on random paths 
defined as follows. Let Rt = {v G 7L d : X s = v for some s < t} denote the range of the walk 
at time t. For a given time t, we consider the Hamiltonian H given by 

H(lj) = \dR t \, (1.1) 

where for a set G, dG = {x £ G : x ~ y for some y ^ G} is the (inner) vertex-boundary of 
G. The associated Gibbs measure on random paths /j, = [it is obtained by considering the 
measure \i defined by 

^( W ) = lexp(-/3#(a,)) (1.2) 

on T . Here f3 > is a positive number playing the role of inverse temperature and Z = 
Z{f3,t) = E(exp(— (3\dRt\)) is a normalizing factor called the partition function. In plain 
words, the Gibbs measure fj, penalizes every site on the boundary of the range Rt by a fixed 
amount e~@ . Hence [i favours "highly condensed" configurations. Interpreting the random 
walk (Xq, . . . ,Xt) as a chain of t monomers, the Gibbs measure /i represents the law of a 
diluted polymer in a poor solvent. With a constraint on the maximal volume and a boundary 
penalization, the random set Rt may be seen as a random walk construction of the Wulff 
crystal droplet with inverse temperature (3. The Wulff construction ([23]) is a method to 
determine the equilibrium shape of crystals based on surface energy minimization. In the 
percolation context, rigorous derivation of the limiting shape were first given by Alexander, 
Chayes and Chayes [2] in two dimensions, while in the context of the Ising model, this was 
achieved in a nearly simultaneous and independent way in a celebrated work of Dobrushin, 
Kotecky and Shlosman [8]. See [4] for a recent monograph giving a detailed overview of the 
subject. 

We will be interested in describing the geometry of the shape of Rt in the low temperature 
regime, where (5 is sufficiently large. Our main result gives precise estimates for the condensa- 
tion effect that results from the self- interaction in dimension 2. If G C Z 2 , let diamG denote 
the (Euclidean) diameter of G: 

diamG = sup{|z — w\,z, w £ G}. 

Theorem 1.1. Assume d = 2. Let a be the connective constant of J? and let (3q = log a. 
Then for all (3 > (3q, 



(1.3) 



as t — >■ oo, where £i(t) = ci(logi) 5//2 and £%(t) = C2(logt) 5 and the constants c\,C2 depend on 
(3 only. 

Remark 1.2. The exponent 1/3 in Theorem \l.l\ is in agreement with the physics predictions, 
which asserts that the radius of gyration has exponent v = 1/3. 

The proof gives precise estimates on the probability of these events, as well as estimates 
on the partition function Z = Z(/3,t). We refer the reader to Theorem 13.61 for a precise 
statement, and refer to Open Problem [3] and the following paragraph for a precise conjecture 
on the geometry of Rt when d = 2. 



3 



A related problem was studied by E. Bolthausen [3]. In that work, the energy H(uj) 
serving to define the Gibbs measure \x in (jl.2p is taken to be H(uj) = \Rt(u)\, the size of the 
range (as opposed to that of its boundary). Thus djl = Zf 1 exp(— j3H(u)))dF. Bolthausen's 
result is that the random walk condensates to a set of diameter i 1//4 , which is close in the 
Hausdorff sense to a Euclidean ball of that diameter. In both problems, good bounds on the 
partition functions Z t and Z t play a crucial role. In the case where the energy is just the 
volume, we have Z t = E(exp(— j3\Rt\)), and precise asymptotics for this quantity were already 
obtained by Donsker and Varadhan [13j. This is a considerably easier problem than the one 
considered here, essentially because \R t (uj)\ is "almost" a continuous function of its local time 
profile, viewed as a probability measure on Z d . In particular, the powerful machinery of large 
deviations theory provides the right tools to study this question. This goes a long way in 
explaining the appearance of the Euclidean ball as a limit shape, and explains why the inverse 
temperature (3 is not a relevant parameter in that model. 

In contrast, here we believe that the limit shape depends on /3 and is not a rotationally 
symmetric ball, and may even contain some flat faces. So the microscopic geometry of the 
lattice is important even to determine the macroscopic shape of the random walk cluster, 
and thus there is no hope in directly applying the large deviations machinery to the problem. 
Instead, a direct and delicate quantitative control of the Markov chain obtained by performing 
the right change of measure is needed. This is the main technical contribution of this paper. 



1.2 Variants 

Our technique is sufficiently robust that it yields similar results for a number of models which 
turn out to be quite closely related. 

A first variant consists in taking a slightly different Hamiltonian H, defined by 

H(u) = L ^)< ( L4 ) 

xedRt 

Thus the penalisation takes into account not only the size of the boundary, but also the 
amount of time spent on it. Define djl = {Z)~ l ex.p(—(3H)dF on T . 

Theorem 1.3. Theorem \l.l\ still holds true with jj, instead on fi. 

A second variant which we consider looks quite different superficially. Define the event 

St = {Lt(x)>P,Vx€Rt}. (1.5) 

This gives a lower-bound on the density of local time uniformly over the range Rt- This also 
favours highly condensed configurations. As we will see later on (see for instance (|3.14p ). the 
conditioning also heavily penalizes shapes Rt with large boundaries: essentially, every point 
on the boundary penalizes the shape by a factor of order e~@, and so we expect a similar 
behaviour to Theorem 11.11 Indeed, we obtain the following results: 



Theorem 1.4. Assume d = 2 and (3 > j3\ for some (3\ < oo. Then 

P ( JJff 1 * ^ diam(^) < t 2 {t)t 1 /* -»• 1 (1.6) 

as t oo. 
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Note that by the isoperimetric inequality, the volume \Rt\ is at most t 2 ^ 3 £2(t) 2 , so the 
typical local time at a point will be of order i 1 / 3 under P(-|£^). This is quite surprising, since 
in the absence of any conditioning, the local time at a typical vertex is of order logt. Thus 
requiring a (finite) uniform lower bound (3 on the local time of each vertex of Rt makes the 
actual value of the local time at a typical vertex jump from logt to i 1 / 3 . 

Acknowledgements. This work started when Ariel Yadin was a Herschel Smith postdoc- 
toral fellow at the Statistical Laboratory, University of Cambridge. We gratefully acknowledge 
the financial support of the Herschel Smith fund and EPSRC grant EP/GO55068/1, without 
which this work would not have been possible. We thank Itai Benjamini for raising the ques- 
tion of the random walk in Theorem 11.41 which eventually led us to this work. Part of this 
work was carried out while the first author was visiting Microsoft Research. He is grateful to 
Yuval Peres for his invitation and interesting related discussions. We also thank Ofer Zeitouni 
for some very useful conversations regarding large deviations and the link between our work 
and [6j . We thank Tom Begley for allowing us to use the pictures in Figure [TJ 

1.3 Notation 

Let d > 1 and let be the space of nearest-neighbours right-continuous infinite paths (ut, t G 
[0, oo)) on Ij d and let (X t (uj),t > 0) be the canonical process. Let P denote the law of 
simple random walk on (£l,J-) in continuous time with rate one, where T denotes the a- field 
generated by X. 

A conductance or weight function is a non- negative function c on the directed edges of 
that is symmetric (so actually a function of the un-directed edges), such that the total weight 
at any vertex is finite: 

c x = } y c(x, y) < oo. 

y^x 

The canonical continuous time Markov process is that with transition rates given by c(x,y). 
P£, E£ denote the probability measure and expectation of this process with respect to a weight 
function c and started at Xq = x. When omitting the subscript, we refer to x = 0. The weight 
function may also be omitted when it is clear from the context. 

If e = (x, y) is an (oriented) edge of % d and <j) a (real-valued) function on its vertices, 
denote V0(e) = V(j)(x, y) = 4>{y) — 4>{x) the discrete gradient, and 

£(<f>, ^)=J2 V< ^( x ' tfWOr, y). (1-7) 

x~y 

£(<f>, 4>) is the Dirichlet energy of the function <j). We also use the Dirichlet energy in a set S: 

£s{4>, i>) = ^2 WO^^WO^y)- 

xr^y 
x,y&S 

Let also 

£f(x) = J2Vf(x,y) 

y~x 

denote the discrete Laplacian on 7L d . This is a self adjoint operator with respect to the inner 
product 

X 

That is, (AM) = (<f>,Ci/)). 
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2 Change of measure estimates 



2.1 Statement of general theorem 

We start by outlining our strategy. A key difficulty in Theorem 1 1.1 1 is to obtain good bounds, 
and in particular lower bounds, on the partition function Z% = E(exp(— /3\dRt\)). Essentially, 
the main contribution to this expectation comes from configurations where the typical diam- 
eter is of order i 1 / 3 , and the range has a boundary size which is not much larger (a log factor 
away). To get a lower bound on the partition function, it suffices to estimate the probability 
of such events. Thus we will fix a square S of size approximately t 1 ^ 3 , and we will estimate 
the probability that the random walk stays in S and that all but a handful of vertices are 
visited. The main difficulty arises from the need to ensure that all vertices are indeed visited. 
This is delicate, especially near the boundary, since each time the walk visits the boundary 
it shouldn't leave the square. Hence we require that sites near the boundary are visited, but 
typically only a finite number of times. In order to achieve this, we carefully choose a profile 
n(x), where ir(x) > only in the square S and ^s ,7r ( x ) = 1> anc ^ require that the random 
walk has a local time profile approximately given by ir. We then prove precise quantitative 
estimates concerning the deviation of the walk from this profile. 

We now introduce our framework. Throughout this section we fix a non-negative function 
7r : Z d — > R and call S = {x G Z d : ir(x) > 0}. We make the following assumptions on it and 
S: there exists L > and some constants Cj > independent of L such that: 

(Bl) 7r is normalised: c\ < X^es 71 "^) — c 2> an d max x 7r(x) < cs/L d 

(B2) For every z G S, there exists a cone C z C S such that z is the apex of C z , diam(C z ) > C4L 
for some constant C4 independent of z, and the angle of C z is uniformly bounded below 
by some a > 0. For all z G S and all w £ C z , 

tt(w) > c 5 ir(z). (2.1) 

(B3) There exists for each pair of points x, y G 5 a fixed path 7^ C S so that \~f X y \ < L and 
for every edge e = (e~,e + ) G jxy, 

vr(e ± ) >c 6 (vr(x) Avr(y)). (2.2) 

Furthermore, for each edge e, 

#{x,yGS : ee lxy }<c 7 L d +\ (2.3) 

(B4) There exists eg > and eg such that min2;7r(x) > cs/L Cg . 
(B5) There exists c%q such that ir(x)/ir(y) < cio if x ~ y. 

Note that since \ j X y\ < L for every pair x,y G S", we have L > diam(S'). In practice, we 
will often choose L = diam(S'). Under these assumptions we will prove the following result. 
Let f(z) = \J tt(z), and define the two following regions: 

S_ = {z G S : Cf(z) < 0}, S + = {z G S : Cf(z) > 0} 
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These are the domains where / is respectively subharmonic or super harmonic. We will prove 
the following theorem, which is in fact a bit more than needed. Define the event 

T> a ,fi = {air(x) < < (3tt(x),x £ Z d }. 

Let Q be the Markov chain with transition rates Q(x,y) = f(y)/f(x) if f(x) > 0, and 
Q(x, y) = else. Let Z t = d¥ /dQ\j- t denote the Radon-Nikodyn derivative of P with respect 
to Q. Then we have the following result. 

Theorem 2.1. Assume (B1)-(B5). Then almost surely on V a ^, 

P eM~tS(f,f))) <Z t < p -i e xp(-|^(/,/)). (2.4) 
Here the error term p is given by 



p = J^L eX p(-t(/? - «)$(/)), with *(/) = f( x )^f(x, V)- 

V ^max 

In particular, if f is subharmonic, then &(f) = and p 



x£S+,y£S- 



TTmax 

Moreover, for all x,z G S, Q x {Ty > t) < exp(— ct7r(y)/(logL) 3 ), and 

Q x (\L(t,y)-n(y)t\>6n(y)t)<eM-cP^)- (2-5) 

(log L) b 

Note in particular that 

apeM~tS (/,/))) < P x {D a>p ) < p- 1 exp(-|^(/,/)) (2.6) 

where 

cr = 1 — C > exp( — c- —r^)- 

where 5 = min(|a — 1|, |/3 — 1|). Thus for fixed a,/3, we have a > 1/2 as soon as tmm y ir(y) > 
C(logL) 7 for some constant C. 

The main idea for the proof of the theorem is to make a change of measure argument 
so that under the new measure, the random walk has a stationary distribution proportional 
to if. Roughly speaking, the role of (B3) is to ensure that the spectral gap under this new 
measure in D is at least 1/L 2 . On the other hand, the cone condition (B2) ensures that 
locally the the random walk under this new measure is not "too recurrent", so that it will 
have at time t a local time profile close reasonably to its limiting value (which is precisely the 
desired stationary measure, proportional to /). 

2.2 Canonical paths 

Lemma 2.2. Let Q be a matrix of rates for a continuous time Markov chain (X t ), on finite 
state space S with measure Q. Suppose that for some probability vector ir we have that 
ir(x)Q(x,y) = ir(y)Q(y,x); that is, ir is in detailed balance with Q. 



7 



Let ( r y X y)x,yes be any collection of paths in S, one for each pair of vertices. Define 
6 = e(Q,7r) = max <x) Q { ^ y) E <zMw)\ lzw \. 

Then, for any e > 0, 

Wx(e) < - log (emin7r(x)) • 9, (2.7) 



where 

Wx(e) = mr < t > : sup 



7r(y) 



1 



< e 



Remark 2.3. Note that we use a slightly different version of mixing time that the usual 
total-variation mixing. It turns out to be more convenient for our purposes. 

Proof. We follow the canonical paths method by Diaconis and Saloff-Coste, see [191 Chapter 
'•• • 

Let D be the diagonal matrix with D(x, x) = J2 y Q( x -> v)- So 

Q x [X t = y]=e-^ D -^(x,y). 

Let II be the diagonal matrix with tt on the diagonal. 

The main point is that because tt is in detailed balance with Q, we get that D — Q 
is self adjoint with respect to the inner product (IL,-), which implies that there exists an 
orthonormal basis <f>i, . . . , (j)\g\ of eigenvectors for D — Q with corresponding eigenvalues Ai < 
X 2 < ■ ■ ■ < Aim. Moreover, we have Ai = and </>i = 1. 

Some short computations as in [191 Chapter 12] yield that for any x, 



sup 

x,y 



UX t = y] 1 



1 



ir(y) mm x ir(x)' 

Also, the min-max characterization (or Dirichlet form) gives 

. (U(D-Q)ip,^) 
A 2 = mm — r . 

(n^, <p) 

(tp,ir)=Q 

So, with the same proof as in the comparison method of [191 Chapter 13], an application of 
Cauchy-Schwarz gives that for any vector <p ^ with (ip, tt) = 0, 

<P<p,<p)<Q-(IL(D-Q)<p,<p). 

Minimizing, we conclude that 7 > _1 which proves the lemma. □ 

Lemma 2.4. Let Q be a matrix of rates for a continuous time Markov chain (Xt) on finite 
state space S, with measure Q. Let T x = ini {t > : X t = x}. Then, for any t > r := 
2 Wx(l/2), uniformly in x,y G S, 

} x [T y >t]< exp ' 



4E y [L(r,y)} 
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Proof. Fix x,y £ S. Note that for any t > t m ; x (l/2) we have that 

®x[X t = y]>^. 

We have 

Q x[H r,y) > 0] = > jLmgj^g > W(V2M y) 

V [ 1 J E x .[L(r,y)|L(r,y)>0] " E,[L(r,y)] " 2E y [L(r,y)] 

Note that T y > t if and only if L(t, y) = 0. Thus, because the above estimate is uniform in 

x G S, 

QxPv >*] < 1- ^ riv. „.m ) <exp' 



2E„[L(r,y)] 7 " " V 4E y [L(r,y 
as claimed. □ 

2.3 Flows and hitting estimates 

Recall the assumptions (B1)-(B5). Given the measure ir in these assumptions, we define the 
continuous time Markov chain (Xt)t on 5, with law Q, to be the chain whose transition rates 
are Q{x,y) = ^/ir(y)/n(x)l^yy. 

A few observations regarding this setup: 

Lemma 2.5. For the chain with law Q defined above, we have that for some constant C > 
independent of L, 

WV2) < CL 2 log L. 

Proof, tt is in detailed balance with Q. 

For the collection of paths given in (B3), we have that for some constant C > independent 
of L, 

6 = 8(Q, tt) = max * V vr(z)7r(u;)|7^| 

a=~J/ Tv{x)Q{x,y) 

(c 6 ) 2 (tt(x) Avr(y)) 2 r „ . . , 

< max y ' \ yjJ ■ L • #{z, weS : (x,y)£ lzw } 

7r(x) A 7r(y) 

< c 7 (c 6 ) 2 L (i+2 max7r(x) < CL 2 , 

X 

where the last inequality follows from (Bl). Using Lemma 12.21 and (B4) we conclude the 
proof. □ 

In order to bound the large deviations of this chain, we will find it useful to bound the 
expected local time accumulated up to time r := 2 t m i x (l/2). This is the content of the next 
lemma. This lemma is also the main use of the cone condition (B2) above. 

Lemma 2.6. Let r := 2t m j x (l/2). For the chain with law Q defined above, we have that for 
some constant C > independent of L, for any y £ S, 

E y [L(r,y)} <C (log L f. 
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Proof. Fix y £ S. Let S m = {z £ C y : dist(z,y) = m} (the distance is the graph distance 
in Let £ = S m for the largest m that is at most dian^Cj). Note that this m satisfies 
m > c 4 L by (B2). 

Step 1. First, we will bound from below the probability that the Markov chain started 
from y hits £ before returning to y. Recall that our Markov chain with law Q is reversible. 
Therefore it is equivalent to a random walk on a network on S where the weight, or con- 
ductance, of the edge e = (z, w) being given by ir(z)q(z, w) = \Jk(z)it(w). It is equivalent 
in the sense that both processes visit the same points in the same order, though possibly at 
different times. Hence it suffices to bound from above the effective resistance TZ e s(y —> S) in 
this network. Using (|2.ip in (B2) above, vr(z) > c^ir(y) for all z € C y . 

If we change the weights on all edges in C y to be c^Ti(y) and the weights on edges not in 
C y to be 0, then we have only reduced the conductance of all edges. Rayleigh's monotonicity 
principle (see [201 Chapter 2.4]) tells us that the effective resistance only increases. So it 
suffices to bound from above the effective resistance between y and £ in this modified network. 

The approach we use is that of [20, Chapter 2]. Let U be a random variable uniform on 
the base of the cone C y at distance L from y. Let R be the ray from y to U. Let T be a 
(random) monotone path that stays at distance at most y2 from R, until reaching £. It is 
easily calculated that for an edge e at distance k from y in C y , the probability that e G T 
is at most 0{k~ <yd ~ 1 ^). Also, the number of edges in C y at distance k from y is at most 
0{k d - v ). Finally, because T is chosen to be monotone, it traverses any edge at most once. 
So the function 6(e) = P[e € T] — P[e £ T] defines a unit flow in the cone C y , from y to £ 
(again see [20l Chapters 2.4 & 2.5]), where e is the directed edge e in the reverse direction. 
Thompson's principle ([20, Chapter 2.4]) then implies that the energy of this flow bounds the 
effective resistance: 

TZ cS (y -»■ S) < Tres(e)8(e) 2 < V ■ 0(k- 2 ^) • 0(k d - 1 ) < ■ O(logL). 

Thus, if we define 

T A = inf {t > : X t € A} , 
T+ = mf{t>T s \ {y} : X t = y} , 
to be the hitting time of sets and return time to y, then for w y = ^2 Xr ^ y \/ n(x)ir(y), 

Q y [T s < T+] = - 1 > (clogL)" 1 , (2.8) 

y w y K eS (y S) 

using (B5). 

Step 2. We now provide a complementary bound for the tail of T^. Let (Y n ,n > 0) 
denote the jump chain of (Xt,t > 0), and let fi(x) = n(x)q x denote the invariant measure 
for Y; note that fi is then reversible for Y. Thus the Varopoulos-Carne bound ( [20\ Theorem 
13.4]) states that 

Q-AYn = z] < 2 • VM^)M*)exp f gffeg)? 
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Since dist(y, S) = m > cL, 

n 

Qy[Tz{Y) < n] < ®yl Y k e S] 

fc=m 

n 

< 2 • y/fi(z)/iJL(x) ■ exp(-cL 2 /2fc) 

zg£ k=m 

So with n = [cL 2 (log L) _1 J for small enough constant c > 0, 
Qy[T s (Y) < n] < C ^ y/n(z)/[i(x) ■ exp(-cL 2 /n) = C ^ y/K^/K^) exp(-c" 1 log L). 

On the other hand, using our assumptions (B1)-(B5) it is clear that ^ze£ V M 2 ) I '^i x ) — CL a 
for some a > 0. Thus choosing c small enough we have that 

Qy[Tz(Y) > n] -> 1. 

Since c < q x < d for some c, c' > and for all x G <S, we deduce that the total number of 
jumps of the jump chain by time t is at most a Poisson random variable with mean c't. Hence 
for some c > 0, for L large enough, by easy deviations for Poisson random variables, 

Q^Ts >cL 2 (log L)- 1 ] > 1/2. (2.9) 

Step 3. Let H(s,t) = H y (s,t) be the number of distinct visits to y in the time interval 
[s,t); that is, the number of visits that its jump chain, restricted to jumps occurring in the 
interval [s,t), pays to y during that time. We conclude from (12, 8p that 

E,[#(0,T S )] = 1 + < clogL. (2.10) 

We now use (|2.10p to estimate K y [L(T,y)]. Define Sq = 0,Tq = and inductively, 

Sj = inf {t > Tj^ x : X t = y} and 2) = inf {t > Sj : I ( eS}. 

These are the successive times for which the chain hits y then £ and then y again, etc. The 
strong Markov property applied to (|2.10p gives that for all j, E y [H(Sj, Sj+i)] < clogL. 

Let J = max{j : Sj < 2i m i x (l/2)}. The events ({£j+i — Sj > cL 2 (log L)~ x })j are in- 
dependent, and each has probability at least 1/2 by (|2.9p . So E y [J] < c • T L . Thus, 

oo oo 

E y [H(0,r)} < J2^y[H(Sj-i,Sj)l {s ._ 1<T} } = (S^, Sj)} ■ Q v [Sj-! < r) 

i=i i=i 

<clogL-E y [J] <c- r(1 °^ 2 L) . 
Together with Lemma 12.51 we have that 

E y [H(0,r)} <c(logL) 3 . 
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Now, given that H(0, r) = k, the local time at y is at most the sum of k independent 
exponential random variables, each of mean Q2 Z Q{Vi z ))^ 1 = \ \/ 7r ( ;Z )/ 7r (y) ) ■ Since 



there exists z E C y with Q(y, z) = y / TT(z)/ir(y) > yfcs (by (|2.1|) in (B2)), we have that 

E y [L(T,y)] < E y [H(0,r)} • (eg)" 1 / 2 < c(logL) 3 , 

as desired. □ 

We first derive the following corollary: 

Corollary 2.7. Let (Xt) be a Markov chain with transition rates given by Q. Let T y = 
inf{i > : Xt = y}. Then uniformly over all x £ S, 



where q^Tf] is a uniform constant. 
Proof. By Lemma 12.41 we have 

tn(y) 



(T v >t)< exp ( - ^ n3 ) . (2.11) 



\ x (T y >t)< exp 



AE y (L(r,y))J ' 

By Lemma \Z6\ E y (L(r,y)) < C(logL) 3 , so the result follows. □ 
2.4 Concentration estimates for local time 

The next lemma is an auxiliary lemma regarding exponentially decaying random variables. 
Lemma 2.8. Suppose that T is a non-negative random variable with 

¥[T >t]< e-i* 
for all t > and some fi > 0. Then, for any a < 2~ 1 / 2 fi, 

E[e^]<l + aE[r] + 2(e+ 2 1)a2 . 

M 

Proof. We use the inequality e x < 1 + x + x 2 valid for all x < 1 (including negative x). This 
immediately implies the result for a < 0. If < a < //, then we have 

E[e aT ] < 1 + qE[T] + q 2 E[T 2 ] + E[e aT l {aT>1} ]. 

Clearly, 

r°° 2 

E[T 2 ] < / 2tF[T > t]dt < -x. 
Jo A* 

The last term can be bounded as follows: 

/*00 /"OO 

E[e aT l {aT>1} ] = eP[T > 1/a] + / P[e aT l {aT>1} > t]dt = e 1 '* + / ae at F[T > t]dt 

Je Jl/a 

< e l-^ +a / e (^)t dt = (l + _^). e l-l 

Jl/a (i — a 
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Moreover, for any x > \[2 we have that 

(x - l)e x >x 2 -l + ^- ^>4- 

Thus, using x = fJ,/a, as long as \[2a < (J,, we have 

E[e aT ] < 1 + qE[T] + 2x~ 2 + , eX . < 1 + aE[T] + 2x~ 2 + 2eaf 

e x (:c — 1) 



□ 



We now use the above results to deduce the following key step in the proof, which is an 
exponential concentration of the local time at a given point. 

Lemma 2.9. Let (Xt)t be a continuous time Markov chain on finite state space S with 
transition rates Q(x,y) = \J ir(y)/ir(x) l^ x ^ y y for it satisfying assumption (B1)-(B5) above. 
For any 5 > 0, uniformly in x,y 6 S, 

® x [L(t,y) > (1 + S)ir(y)t] < Cexp f-c^^^j + Cexp (-<g$7r(y)t) , (2.12) 

and for0<8< 1/2, 

Qx[L(t,y) < (1 -SMy)t] < Cexp L^^^j . (2.13) 

Proof Fix y £ S. In this proof it is convenient to define a time A by putting 

A = ^° gL)3 . (2.14) 

((j22]is the constant from Corollary 12.71 ) Intuitively, y is hit with positive probability (which 
can be made as high as desired) at least every A units of time by Corollary 12.71 Note that 
since ir(y) < c/L d , A > r = 2 i mix (l/2) by Lemma [23J 

We start with the proof of the lower bound on L(t, y), (|2.13p . which is very similar except 
slightly harder than (|2.12p . Let N(t) denote the number of distinct visit to y by time t. Let 
q y = Ylx~ y Q{Ui x ) be the total jump rate from y under Q. 

Let n = \ir{y)q y t(l — 8)~\. Note that by assumption (B5), c < q y < d for some absolute 
constants c,d. Moreover, if L(t,y) < ir(y)t(l — 25), then either N(t) < n or Y27=i &i — 
ir(y)t(l — 25) < ir(y)t(l — 5) 2 < (1 — 5)n/q y , where Ei are independent exponential random 
variables with rate q y . The exponential moment of Ej is E[e _a£; -'] = - 9 ^_ a for all a > 0. So 

an application of Chebyshev's inequality to the non-negative random variable e~ a ^ Ej gives 



i=l 

Optimizing over a we obtain 



^^<(l-5)n/ % ]<(^-- e M^) 



n 

E E l <(l- 5)n/q y ] < ((1 - 5)e- 5 ^j " = exp (-c5ir(y)t) . (2.15) 
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To bound the other term, it will be useful to define 

T = inf{i>0 : X t = y} T + = inf {t > T s \ {y} : X t = y) , 

which are the hitting and return time to y, and also the successive return times to y: Tq = T, 
and for k > 0, 

S fc = inf{i>r fc _i : X t ^y} T k = M{t>S k : X t = y} . 

Clearly N(t) < n if and only if T n > t. T n is the sum of the independent increments Tj —Tj-\. 
By Corollary 12.71 each of these increments has an exponential tail Q x [Tj > 3}_i +t] < e~ A lf . 
Thus, Lemma ES] (with fi = A" 1 ) gives that for any < o. < ; 

E a [e a(T '' -T '- l) ] < 1 + Ea.[Tj - T^i] + 2(e + l)a 2 A 2 < exp (a^p) - Tj_i] + 2(e + l)a 2 A 2 ) . 

Note that E^ [T n ] = nE y [T + ] + (E X [T + ] - E 2/ [T+]). By Corollary E3 the second term is at 
most E X [T+] -E y [T + ] < A. Also, it is well known that E y [T+] = (^(y)) -1 . So Chebyshev's 
inequality gives 

Q x [N{t) <n} = Q x (T n >t)< E x (e aT ")e~ at 

< exp (at(l -5)+aA + 2(e + l)a 2 A 2 (n + 1) - at) . 

Choosing a = 5/(ir(y)q y A 2 ), which is essentially optimal, we note that < a < for L 
sufficiently large, hence 

QxWt) <n]< Cexp Lc^^^j (2.16) 

after simplification. Combining (|2.15p and ()2. 16[) gives (|2.13p . 

The proof of (|2.12p is similar. Here we choose n = [ir(y)q y t(l + 5)\. If L(t,y) > (1 + 
5) 2 ir(y)t then either N(t) > n or Ya=i E i > K{y)t(l + 5) 2 > (n/q v )(l + 5). Simlarly to the 
argument above, 

n 

P£ Ei > 7r(y)t(l + S) 2 } < e^^K 
i=l 

Also, by Lemma 12.81 and Corollary 12. 7\ as before, 

Qx[N(t) >n}< q x [T n <t}< E[e- aT -] ■ e at 

< Cexp {-at(l + 5) + 2(e + l)o 2 A 2 n + at) 

/ 5 2 7r(y)r 

< Cexp -c- —7 

V (log L ) f 

This completes the proof, continuing as in the lower bound. It is important here that we do 
not restrict ourselves to 5 < 1, so for large 8, the bound e~ c&1T ^ 1 kicks in. □ 
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2.5 Change of measure 

The following analogue of the Gauss-Green formula will be useful. 

Lemma 2.10. Let (f>,ip be two given functions with support contained in a finite set S. For 
any D C S, 

Y J 4>{x)mx) = -\£ D {(j ) ^)+ <Kx)Vil>{x,y). (2.17) 

(This is the analogue of the classical Gauss-Green formula f fAg = — J VfVg valid for 
smooth functions f,g with compact support, but note the additional boundary terms.) 

Proof. We have 

£d(4>,iP) = Y V(j)(x,y)Vif}{x,y) = Y (j)(y)Vip(x, y) - Y (p{x)V^{x,y) 

x^y£D x^ydO x^ydD 

= -2^0(x)£^(x) + 2^ Y, <f>(x)Vil>(x,y). 

xeD xeD D$y~x 

□ 

The crucial change of measure calculation is provided by the following lemma. 

Lemma 2.11. Let tt be some probability vector and let S be a connected subset of Z d con- 
taining the support of it, {it > 0} C S. Let Q be the law of the continuous time Markov chain 
with transition rates Q(x,y) = y ir{y)/'K(x) l^ xr ^ y y . 

Let Z t denote the Radon-Nikodym derivative off with respect to Q ; restricted to Ft: 



dF 

A t ■— "TFT 



(2.18) 



Define 

S_ = { x e S : Cf{x) < 0} and S+ = {x G S : Cf{x) > 0} 

is the domains where f is respectively super-harmonic or sub-harmonic. 
Then, for any 

Lit x) 

w £ T> a>f) = {avr(x) < < pn(x), x £ Z d }, 

we have the pointwise bounds: 



Z t (u) > J exp • § • £(f, /)) • exp (-/ ■{?-<*)• *(/)) , (2.19) 

and likewise, 



ZM < exp (-t • f £(f, /)) • exp (t ■ {fi - a) ■ *(/)) . (2.20) 



where f = ^/tt and 

*(/)= E /(z)V/(z,y). 



x&S+,y&S- 
x^y 



Specifically, if f is sub-harmonic, then 3>(/) = and 



mm 



max 



£ . exp (-t • f f (/, /)) < AM < J ^ • exp H • f f (/, /)) . 
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Proof. Let us start by expanding the energy £(/,/). Because / vanishes off 5, the Gauss- 
Green formula (Lemma I2.10p gives 

£(f, f) = -2J2 f{x)Cf{x) = -2 J2 f(x)Cf(x) - 2 £ f(x)Cf(x). 

xes xes+ x&S- 

Let 

B := - f(x)Cf(x) and A := £ f(x)Cf{x). 

xeS- xes+ 

So B,A> and B - A = |f (/, /). 
Also, again by Lemma 12,101 

A = -^ s+ (/,/) + £ mVf(x,y)- £ 

xeS+,yeS- x.eS + ,y<£S 
x^y x~y 



and similarly, 



S = ~£s_ (/,/)- £ f(x)Vf(x,y)+ £ /C 



xeS-,y&S+ x&S-,y£S 
x^y x^y 



Note that 



If(/,/) = If s _(/,/) + l£ s+ (/,/)+ £ ( /(y ) - + £ 



x£S+,y£S- xES,ygS 
x^jy x^jy 



Thus, since £d(I, f) > 0, we can bound 

B + A= 1 -(£ s _(fJ)-£s + (fJ))+ £ (/W + /(y))V/(x,y)+ £ /(x) 2 - £ 



xe.S+,yeS- x£S-,y£S x£S+,y£S 

x^y x^y x^y 



<l(£sMJ) + £s + (f,f))+ £ l v /(x,y)| 2 + £ (/(y) 2 -/( 

x^y x^y 

= ^£|v/(*,y)| 2 - £ (/(y) - /(*)) 2 + E (/(y) 2 -/(^) 2 

xeS+,yeS- x£S+,y£S- 
x^y x^y 

= l -£{f, /) + 2 £ /(x)V/(x, y) = l -£{f, f) + 2$(/). 



Since £> > A, this implies that for any < a < /3, 

_ pB + aA = -P±2.( B -A)- ^(A + B)> -^£(f, /)-(/?- «)$(/). (2.21) 
Similarly, 

+ pA = -^{B -A) + ^(A + B)< -|f (/, /) + (/?- «)$(/). (2.22) 
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Let oj G T> a ^. Suppose the successive states visited by u up to time t are x$, . . . ,x n , with 
the path staying a time tq, . . . ,r n at respectively at these locations. (Hence t\ + . . . +r n = t.) 
Assume oj E T> a p. If x G then the total rate at which the particle would jump out of x 
under Q is given by q(x) = /(x)" 1 Y, y ~ x f(y)- 

Then a quick calculation shows that, letting d{x) = 2d be the total rate of leaving x under 

P, 

g— d(xo)ro e -d(x„)r„ 



Q(x , x^e-i^o . . . Q(x„_i, x n )e-9(^) T " 
= 7r4 II exp{(g-d)(x)L(i,x)} 

We split this sum into a sum on S- and 5+. Using the fact that oj G fa,/?, (|2.2ip provides 
the lower bound 

^(w) > exp (t ■ {-(3B + aA)) > exp (-t ■ §£(/, /)) • exp (-t ■ (J3 - a) ■ *(/)) , 



which proves (|2.19p . Proving ()2. 19[) from ()2.22j) is similar. □ 

This section is concluded by 

Proof of Theorem \2.1[ The upper bound is a direct application of the upper bound in Lemma 
12.111 For the lower bound, using the lower bound in Lemma 12.111 we need to show that 

Q[Z>a,4»] > * = 1 " C^Texp (-c(a - l) 2 Tr(y)t • (logL)- 6 ) . 
y&S 

This follows from (|2.12|) and (|2.13|) with suitable choice of 5. □ 

3 Proof of Theorem 11.11 
3.1 Partition function estimate 

The goal of this section is to obtain the following lower bound on the partition function. 
Proposition 3.1. 

^,/3>exp(-c(logt) 5 -t 1 / 3 ) 
for some constant c depending only on (3. 

Remark 3.2. Our proof relies on Theorem \2.1\ Ofer Zeitouni has pointed out to us that 
there is another and perhaps altogether simpler route for obtaining this lower bound using the 
methods developed in [6]. We have chosen to keep the proof as it was because we believe that 
Theorem \2.1\ may prove useful for obtaining finer results than the ones we discuss here. 
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Let L be the nearest odd integer to i 1 / 3 . To prove Proposition 13.11 we introduce a square 
S of side length 2L + 1 and containing the origin as an arbitrary fixed point (e.g. , the 
center); that is, S = [~L,L] 2 n I?. 

The argument relies on an appropriate change of measure and uses the techniques de- 
veloped in the preceding section. We proceed to define the local time profile ir. For z £ S 
define 

"•(*) = p , (3.1) 



and tt(z) = for z S. Let / = y^F. Note that because 2y/a > ^Ja + 1 + \/a — 1 for a > 0, 
we have that 

£/(*) < , V z G 5. (3.2) 

That is, / is sub-harmonic. It is also not difficult to verify that tt satisfies (B1)-(B5). Let 
Q be the law of the continuous time Markov chain on S with transition rates given by 
Q(x,y) = <Jn(y)/7r(x)l{ x „ y y. 

Let K = K(L) = C(logL) for some large enough constant C (independent of L,t) to be 
chosen below. We define the event 

A := {Vz G S, L(t, z) < KTt(z)t}. (3.3) 

We also define the event B: 

B:={\dR t \ < CL(logL) 3 }. 

Finally, define the good event 

G = AC\B. (3.4) 
Proposition 13.11 will follow immediately from the following estimate: 

F[B] > F[g] > exp(-c(logt) 5 t 1/3 ), (3.5) 

since for any weB, \dRt\ < CL(log L) 3 and hence the energy satisfies H(u) > exp(— cL(log L) 3 ) > 
exp(-ct 1 / 3 (logt) 3 ). 

We will prove (|3.5|) by insisting that the random walk stay in S with a certain well-defined 
local time profile tt within S. 

Remark 3.3. Note that it is quite simple to estimate the probability to stay in the square 
S for time t, see e.g. Lemma \3. 7| later on, which gives a slightly better lower bound than 
Proposition \3.1[ However if we simply condition on staying in S, the actual boundary of the 
range might in fact be much larger than L, the boundary of the square S. This is because 
conditionally on staying in S, points near the boundary will in fact not be visited, as a quick 
calculation shows. In fact, the boundary of the range will then be typically at distance vL 
from the boundary of S, and probably is a very irregular (fractal) curve. 

We turn back to the event Q defined in (|3,4p . The following proposition says that the 
good event Q has a reasonable probability under the measure Q. 

Proposition 3.4. For L sufficiently large, Q[Q] > 1/4. 
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Proof. Using Lemma [2T9l and specifically (|2.12p . for any z,w G S, 

Qz[L(t,w) > Kir(w)t] < exp (-c(log L) 2 Tr{w)t) , 

where the constant c > can be made large by changing C in the definition of K{L). As long 
as cn(w)t > 1 for all w G S, a union bound on w G 5 gives that Q 2 L4] = 1 — 0(L 2 • e~( logZ ') ) 
as L — > oo. 

Now, we estimate E[|(9i? t |] under Q. Since the maximal degree is 4 we obtain that \dRt\ < 
4| 5 \ i? t | . Using Corollary IZ71 

E[|5\Bt|] = £Q[z £ = J>[T, > £] < J>xp (-07^3) • 
zes zes zes ^ IS;/ 

For 1 < r < L, the vertices in S of distance r from SS 1 are exactly { — (L — r), L — r} x [— (L — 
r), L — r] |J[— (L — r), L — r] x { — (L — r), L — r}, which is the square boundary of side-length 
2(L — r) + 1. This set is of size 8(L — r) (except for r = L where it is of size 1). So, with 

L-1 L-1 

x-A 



E[|5 \ Bt|] < e~ LA + 8(L - r)e~ rA = e" LA + 8e~ LA • e 

r=l x=l 

1 + 8 • J xe xk dx^j = 0(L 4 (log L) 3 • r 1 ). 



Thus, by our choice of L = i 1 / 3 , 

Q z [\dR t \ > CL(logL) 3 ] < {c'/C^h- 1 = (c'/C). 

This is at most 1/2 if C > is large enough. 
Both estimates together give 

Qz[G] >Q Z [B]-Q Z [A C } > |-o(l), 
as L — > 00. □ 
We now turn to: 

Proof of Proposition HOI By Lemma 12.111 and Proposition 13. 4i we have that 

F(G)>jeM-Kt£(f,f)). 

Thus (13, 5j) . and hence Proposition [XH will follow from the estimate £ (/, /) < clog L-L~ 2 . For 
1 < r < Llet 5 r = { — (L — r), L — r} x [— (L — r), L- r] (J[— (L — r), L — r] x {— (L — r), L — r} 
be the set of vertices which are at distance r from dS. This set is of size 8(L — r) if r < L — 1 
and 1 if r = L. For 1 < r < L, ifz~u> for z G 5V+i,?y G SV then a first order Taylor 
expansion gives 

\f(z)-f( w )\<L^.^=. 
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If z G Si and w £ So then \f( z )- f(w)\= L~ 3 / 2 . Thus, 



L-l 



£(/>/) <E E 1/W-/HI 2 

r=0 2eS r + l,t«eSr 

<L-3.gl.8(L-r-l) + L-3 4 .— + 8(L-1)L 



-3 



r=l 

2i 



< 0(L _2 logL 



□ 



3.2 Discrete planar isoperimetry 

We now state and prove a modified isoperimetric inequality which deals with the outer bound- 
ary of a set. We first need some definitions. For a set G C Z 2 , let Ext(G) be the unique 
unbounded connected component of 1? \ G. Let the outer vertex boundary d*G be defined 
by 

d*G = {x G G : 3y G Ext(G),x ~ y}. 

The outer edge boundary, denoted by d*G, consists of those edges e = (x,y) with x G G and 
y G Ext(G). 

Lemma 3.5. Let G £ Q. Let R be the smallest rectangle in 1? containing G. Then, \d*R\ < 
3|9*G|. Furthermore, \G\ < \R\ < \d*G\ 2 . 

Proof. For any connected set S, we have that \d*S\ < \d*S\ < 3|<9*»S| since adjacent to any 
vertex in d*S there are at most 3 edges in d^S (and at least one edge must connect that 
vertex to the set itself). Thus, it suffices to prove that \d*R\ < \d*G\. 

Let (x,x + e) G d*G, for some e G {(±1,0), (0, ±1)}. Then x G G and hence x G R. 
Thus, there exists a (necessarily unique) k > such that x + fee G i? and x + ze i? 
for all z > k. Thus, (x + ke,x + (fe + l)e) G d* e R. Define a map <fi : d* e G -> by 
</)((x, x + e)) = (x + fee, x + (fe + l)e) for this fe. 

We claim that cj> : d*G — > d*R is onto. This follows since if (x,x + e) G d*R, then 
considering the line L = {x — fee : fe > 0}, it must be that L n G 7^ 0, since otherwise either 
G would not be connected or R would not be the smallest rectangle containing G. Thus, there 
must exist some fe > such that x — fee G G and x — ze G for any z < k. Thus, the edge 
(x — fee, x — (fe — l)e) is in <9*G, and it is immediate that 4>(x — fee, x — (fe — l)e) = (x, x + e). 

This proves that there is a map from <9*G onto d*R. So < \d*R\ < \d*G\ < 3\d*G\. 

The second assertion follows from the fact that (fe + m) 2 > Akm. So for a fe x m rectangle 
R, \dR\ 2 =4(fe + m) 2 > 16fem = 16|i?|. □ 



3.3 Proof of condensation 

We will prove the following more precise statement of Theorem 11.11 

Theorem 3.6. Assume f3 > /3q = logo, where a is the connective constant. There exist 
constants ci,C2 > depending on (3 only such that the following holds: 



cii 1/3 (logt)- 5 < diam(i? t ) < c 2 t 1/3 (log tf > 1 - exp (-ct 1/3 (log i) 5 ) . (3.6) 
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With these preliminaries we are now ready to prove Theorem 13.61 

Proof of Theorem \3.b\ Let L be the closest integer to i 1 / 3 . Recall that by Proposition 13.11 

Z(t,/3) > exp(-ct 1 / 3 (logt) 5 ) (3.7) 

where c > is a constant depending only on j3. 

Now for k > 1, let Fk be the set of closed simple loops 7 : {0, . . . , k} — > 1? of length k 
(and thus 7(0) = 7(fc). We will abuse notations and also write 7 for the trace of 7, i.e., the 
set {7(0), . . . ,7(A; — 1)}. For 7 G Tk, we will estimate the probability that d*Rt = 7. Note 
that 

limsup|r fc | 1/fc < a, (3.8) 

k— >OD 

where a is the connective constant of Z 2 . Let /3o = log a and choose f3 > (3q. Choose also 
/3' = so that /3 < p < p. 

Let M > be large enough so that \Tj\ < e@ J for all j > M. Then, 

t 

Ht[\9*Rt\ > M] < Y, E ^^(i^) - ^^*^ = 7] 
j'=Af 7er, 
00 

< e^'-^'exp^^aogt) 5 ) 

j=M 

< (l - e -5(^-A))^ ~' • exp (ct 1 / 3 (logt) 5 -K/3- /3 )m) . 

Thus, if we take 

M>C^t 1 /3 (log0 5 5 (39) 

with C > 4, we get 

> M] < exp(-ct 1/3 (logt) 5 ). (3.10) 

Now, it is an obvious geometric fact that for any shape G C Z 2 , if R is the smallest 
rectangle that contains it then 

diam(G) < \d*R\ < 3\d*G\, (3.11) 
where the second inequality comes from Lemma 13.51 We deduce that 

Ht[diam(R t ) > M] < exp(-ct 1/3 (logt) 5 ). 
Since diam(i? t ) < M implies that Rt C [— M, M] 2 , we have that 

IH[\Rt\ > (2M + l) 2 ] < exp(-ct 1 / 3 (logt) 5 ). 
This bounds from above the external boundary size \d*Rt\, diameter diam(i?t) and volume 

\Rt\- 

For the lower bound on the diameter and boundary size we require the following estimate. 
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Lemma 3.7. Let R be a rectangle of sidelengths J and K respectively. Then if z € R, 
Vz[Rt CR}< JKexp + 3^)) < JKexp ("^) ■ 

Proof. For the purpose of this proof we may assume (by standard deviation bounds on Poisson 
random variables) that P is the law of continuous time simple random walk on Z 2 . In this 
case the coordinates Xt,Yt are independent rate (1/2) simple random walks on Z. We just 
focus on X t and compute F X (T > t) where x € {1, . . . , J} and T = inf{t > : X t £ [1, J}}. 
Let C denote the generator of (rate 1) simple random walk on Z, and let <f){x) = e lnx ^ J . It is 
trivial to check that 

Ccfr(x) = —\<j){x) 

for all x 6 Z, where A = 1 — cos(tt/J). Thus if we let ^(t, x) = e A * sin(7rx/J) we have 

^ + &p = o 

and hence Mt '■= e xt sin(irXt/ J) is a martingale. Consequently, applying the optional stopping 
time theorem at the time t AT (which is bounded), and the inequality sin(n) > (2/ir)u valid 
for < u < vr/2, yields 

sin(7nE/J) = E :c (e A 'sin(7rX MT /J)) 
>e Xt j¥ x (T>t). 

Therefore, 

P*(T > t) < Je' xt . 

Now, A = 1 — cos(7r/J) > 7r 2 /4J 2 for J large enough, and the result follows. 

The last inequality is just 2JK < J 2 + K 2 which always holds. □ 

We now combine Lemma 13.71 with (|3.1U|) to obtain a lower bound on the diameter of Rt 
under fj, t : 

Let i?f be smallest rectangle containing Rt. Let Jj, Kt be the side-lengths of R^. Lemma 
13.51 tells us that 

2{J t + K t ) - 4 = \dR^\ = \d*R^\ < Z\d*R t \. 

Note also that 

\R?\=j t K t <(^) 2 <i(^^y. 

If Jt = j, Kf = k then Rt C [x, x + k] x [y, y + j] for some |x| < k,\y\ < j. Also, by Lemma 

mi > \o*R t \ > 2(Jf+ ^~ 4 . 
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So by Lemma 13.71 f° r an integer N > 0, 

iV 

fh[Jt,K t < N] < Vt[ K t = k,J t = j] 
k,j=i 



fc,j=l x=—ky=—j 

< Z(t,P)- 1 ■ p exp (-/3 2(fc+ 3 j) ~ 4 ) • (2* + l)(2j + l)ifcexp (-0) 

< Zfe/^Cexp (-^) . = C'exp (^(logt) 8 " y • ^) • 



Thus, if 
we get that 



AT<l!tV3 (logt) -5 ) 



< AT] < exp (-ct 1 / 3 (log i) 5 
Since these are the dimensions of the smallest rectangle containing Rt, with high /if-probability 

\8Rt\> c'N and diam(i? i ) > c'N, 
for some small enough d > 0. □ 

3.4 Proof of Theorems [d and [TTil 

We explain how to adapt the arguments of the proof of Theorem 11.11 to give the proof of 
Theorem 11.41 Let T y (f3) = inf{i > : L(t,y) > f3}. The argument of Lemma 12.41 shows that 



HUP) >t)< exp(- 7 ( f } • -^—e-^) < exp(-c f ® ; 

4E y (L(r,y)) r + /3 E y (L(r,y)) 



Thus by Lemma 12.61 

P(W) > t) < exp ( /77! ;7i 



c(logL) 3 / 

which is the analogue of Corollary 12.71 Now, recall the events A in the proof of Proposition 

ED 

A = {Vz e 5, L(i, a) < Kir(z)t}; 

We modify the definition of B. Let S" = {z G 5 : dist(z, > (log L) 4 } and consider the 
event 

B' = {iz e S',L(t,z) > /?}. 
Let Q' = g' t = AC\B'. Then Q(g') > 1/4 for L sufficiently large. Indeed . Then 



Q(B' C ) < ^Q(T 2 (/3) >i) < 1/2 
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for L large enough, exactly as in the proof of Proposition 13.41 We deduce that 

P(g') > exp(-c(logt) 5 t 1 / 3 ). 

Assume that Q' t ^ 2 holds. In the remaining t/2 units of time, we make sure that the each of the 

remaining 0(L(logL) 4 ) vertices of S \ S' are visited at least (3 units of time each, as follows: 
for each 1 < k < (logL) let = {z £ S : dist(z,dS) = k}. We visit each vertex in in 
clockwise order, starting from (— L + k,0). At each new vertex, the walk remains at least (3 
and at most 2(3 units of time. When the walk has visited each vertex of S^, it moves on to 
Sfc+i. The total amount of time spend doing so is at most 2/3L(logL) 4 , which is much less 
that t/2. In the remaining amount of time, the walk is free to do what it wants, provided it 
stays in S. If all these conditions are fulfilled, it is clear that Rt = S and that each vertex 
has a local time greater than f3, so &t holds. The probability of visiting every vertex in this 
prescribed order immediately after t/2 is at least exp(— cL(log L) 4 ) for some c = c(/3) < oo. 
The probability of remaining in S after that is at least exp(—K(t/2)£(f,f)) and hence at 
least exp(— c(logi) 5 i 1/ ' 3 ). All in all, we deduce 

P(St) > exp(-c(logt) 5 t 1/3 ). (3.12) 

This argument also shows that if Z(t, f3) is the partition function corresponding to the Hamil- 
tonian H = J2 x edR t L(t,x) in (jl.4p . then 

Z(t, 0) > exp(-c(log tft 1 / 3 ). (3.13) 

Now, we claim that 

F(R t = G,£ t )<ex P {~P\dG\). (3.14) 

For each x G dG, let y be a neighbour of x such that y £ G. Consider the event J X y(t) that 
by time t there has never been a jump from x to y. On £j, x is visited at least (3 units of time. 
While at x, the rate of jumping to y is of course 1/4. Let E xy be independent exponential 
random variables with rate 1/4, which represents the amount of time a particle would have 
to wait before jumping to y. Thus J X y(t) R St C {E xy > f3}. Hence 

¥{Rt = G,£ t ) < P(n xedG J xy (t) R £t) < P(n xe9G E xy >(3) < e~^ dG ^ 

by independence of the random variables E xy . Thus (|3.14p is established. 

Putting together (|3.12p and (|3.14|) (resp. (|3.13p an d * ne definition of jl), the proof of 
Theorem 11.41 (resp. Theorem 1 1 .3 j) proceeds exactly as in Theorem 13.61 

4 Open problems and conjectures 

We now discuss several open problems. The most basic ones are the following. 
Problem 1. Does Theorem \1.1\ stay true ifO<f3<(3o? 

It is possible to imagine that there is a phase transition from subdiffusive to diffusive as 
f3 decreases to 0. However simulations suggest that no such phase transition occurs and that 
the result should remain true even if f3 < (3q. 

Our next question is to ask whether logarithmic factors in Theorem 1 1.1 1 are really present. 
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Problem 2. Find f(t) such that diam(Rt) / f (t) converges to 1 in ^-probability as t —> oo. 



Another immediate question is to obtain the limiting shape of the random walk: 
Problem 3. Find a set S = S(/3) C R 2 such that 

Fit 

inf — — ; z + S) -> 

zm 2 diam(i2t) 

in probability, where du aus stands for Hausdorff distance. 

The reason for looking for sets of the form z + S to approximate the normalised range of 
the walk, is that we believe that with this formulation S can be chosen to be nonrandom: 
that is, a certain random translation of the deterministic shape S gives a good approximation 
of the normalised range. 

Once the existence of S is established one may ask numerous questions about its geometry. 
For instance, does it have any flat facet? We conjecture in fact that as j3 — > oo, S(/3) 
converges in the Hausdorff sense to a diamond of unit diameter. Indeed it is easy to see that 
the diamond is the minimiser of the isoperimetric problem for the vertex-boundary: i.e., if 
d v S = {x G S : 3y ^ S, x ~ y}, then 

min \d v S\ 
\S\=v* 

is attained for a diamond S = {x, y : \x\ + \y\ < n}, whenever v = 2n(n + 1). 

It is interesting to compare this situation for the case of a random walk conditioned on 
{Lt(x) > P, Vx G Rt}- There again, we believe a similar shape theorem holds, but the limiting 
shape of S(f3) as /3 — > oo should instead be a square with unit diameter. This is because, 
(13. 141) in fact shows that 

F[Rt = G,£ t ] <exp(-/3|a e G|) 

where d e G denotes the edge boundary of a graph G. Thus, when /3 — > oo, it is reasonable 
to guess that S((3) should minimise its edge boundary, rather than its vertex boundary , and 
hence be a square rather than a diamond. 

Typical fluctuations of the shape (in the case where /3 > (3q say) raise intriguing ques- 
tions. In the percolation construction of the Wulff crystal when /3 is large but finite, these 
fluctuations are known with considerable precision. For instance, the so-called maximal local 
roughness, which measures the maximal distance from a point on the boundary of the shape 
to the polygonal hull of that shape, is of order n 1 / 3 if the diameter of the Wulff shape is of 
order n: see [23], pQ; more recently an extremely precise result in this direction was recently 
established by Hammond |18j . These exponents are identical to those arising in the Kardar- 
Parisi-Zhang (KPZ) universality class. While the Wulff crystal is not believed to belong to 
this universality class, there is nevertheless an analogy between the two situations, a fact 
which guides the intuition in the approach of [18J. 

Problem 4. What are the maximum local roughness and facet length of the shape Rt when 
t ^ oo? 

We speculate that the exponent for the fluctuations take the value in agreement with the 
Wulff crystal (and hence also with KPZ): this suggests that the maximum facet length is of 
order i 2 / 9 and the maximum local roughness of order i 1 / 9 . 
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Problem 5. What is the diameter diam(i?t) when d>3? 

We believe that, at least for /3 sufficiently large, the diameter is essentially given by the 
following formula: 

diam( J R t ) (t/f3)d+T (4.1) 

with high probability, possibly up to logarithmic factors. To see where this comes from, note 
that in d dimensions, the probability that a random walk stays in a box of side length L is 
approximately exp(— ct/L 2 ). The value of the energy should then be not too different from 
f3L d ^ 1 . Equating 

yields the value in the right hand side of (|4.ip . 

Naturally, what makes this problem more difficult in three dimensions than two is that 
the topology of the boundary is much more complicated. For instance, the shape could 
conceivably have a few thin protuberances ("hairs") sticking out of a globally compact shape. 
Though each such occurrence increases the energy significantly and so should be penalised, 
the size of the ambient space available means that there are a lot of opportunities to do so. 
Such difficulty was already present in the study of the Wulff crystal for the Ising or percolation 
models (see e.g. [2]). 
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